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fSj ' Abstract 



We propose a very simple preconditioning method for integer programming feasibility problems: 
replacing the problem 

b' < Ax <b 

X e Z" 

with 

b' < {AU)v < b 

y e Z", 

^ ' where C/ is a unimodular matrix computed via basis reduction, to make the columns of AU short 

C^^ , (i.e., have small Euclidean norm), and nearly orthogonal (see e.g., [13], [H]). Our approach is 

termed column basis reduction, and the reformulation is called rangespace reformulation. It 
is motivated by the technique proposed for equality constrained IPs by Aardal, Hurkens and 
Lenstra. We also propose a simplified method to compute their reformulation. 

We also study a family of IP instances, called decomposable knapsack problems (DKPs). 

DKPs generalize the instances proposed by Jeroslow, Chvatal and Todd, Avis, Aardal and 

f^ \ Lenstra, and Cornuejols et al. They are knapsack problems with a constraint vector of the form 

pM + r, with p > and r integral vectors, and M a large integer. If the parameters are suitably 

chosen in DKPs, we prove 



• 



hardness results, when branch- and-bound branching on individual variables is applied; 



5t 1 • that they are easy, if one branches on the constraint px instead; and 



• 



that branching on the last few variables in either the rangespace or the AHL reformulations 
is equivalent to branching on px in the original problem. 

We also provide recipes to generate such instances. 

Our computational study confirms that the behavior of the studied instances in practice is 
as predicted by the theory. 
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1 Introduction and overview of the main results 



Basis reduction Basis reduction (BR for short) is a fundamental technique in computational 
number theory, cryptography, and integer programming. If ^ is a real matrix with m rows, and n 
independent columns, the lattice generated by the columns of A is 

h{A) = {^x|a;GZ"}. (1.1) 

The columns of A are called a basis of L(^). A square, integral matrix U is unimodular if det U = 
±1. Given A as above, BR computes a unimodular U such that the columns of AU are "short" 
and "nearly" orthogonal. The following example illustrates the action of BR: 

'289 18' 
^ = I 466 29 I , [/ = I ^ „ /,^\,AU 
273 17^ 

We have L(^) = I^{AU). In fact for two matrices A and B, L(yl) = L(i3) holds, if and only if 
B = AU for some U unimodular matrix (see e.g. Corollary 4.3a, [33j). 

In this work we use two BR methods. The first is the Lenstra, Lenstra, and Lovasz (LLL for 
short) reduction algorithm [26] which runs in polynomial time for rational lattices. The second is 
Korkhine-Zolotarev (KZ for short) reduction - see [23] and [32] - which runs in polynomial time 
for rational lattices only when the number of columns of A is fixed. 



Basis reduction in Integer Programming The first application of BR for integer program- 
ming is in Lenstra's IP algorithm that runs in polynomial time in fixed dimension, see [27]- Later 
IP algorithms which share polynomiality for a fixed number of variables also relied on BR: see, for 
instance Kannan's algorithm [25]; Barvinok's algorithm to count the number of lattice points in 
fixed dimension [HI [18], and its variant proposed by de Loera et al. in [28]. A related method in 
integer programming is generalized basis reduction due to Lovasz and Scarf [30] . For its implemen- 
tation see Cook et. al in [13]. Mehrotra and Li in [31] proposed a modification and implementation 
of Lenstra's method, and of generalized basis reduction. For surveys, we refer to [33] and |24j . 

A computationally powerful reformulation technique based on BR was proposed by Aardal, 
Hurkens, and Lenstra in [2j. They reformulate an equality constrained IP feasibility problem 



(1.2) 



with integral data, and A having m independent rows, as follows: they find a matrix B, and a 
vector Xf, with [S,^?,] having short, and nearly orthogonal columns, xi, satisfying Axb = b, and the 
property 

{xGZ"|Ax = 0} = {5A|AgZ"-'"}. (1.3) 



Ax 


= 


b 


i< 


X 


<u 




X 


eZ' 



The reformulated instance is 



(■ — Xh < BX < u — Xb 

A G Z"-™. 



(1.4) 



For several families of hard IPs, the reformulation (jl.4p turned out to be much easier to solve 
for commercial MIP solvers than the original one; a notable family was the marketshare problems 
of Cornuejols and Dawande |15j . The solution of these instances using the above reformulation 
technique is described by Aardal, Bixby, Hurkens, Lenstra, and Smeltink in [l]. 

The matrix B and the vector Xb are found as follows. Assume that A has m independent rows. 
They embed A and 6 in a matrix, say D, with n + m + 1 rows, and n + 1 columns, with some entries 
depending on two large constants A^i and A'^2: 



-'n Uji 



xl 



D = I Oixn A^l 

^N2A -N2bj 



:i.5) 



The lattice generated by D looks like 

L(D) = < 



X 

NiXQ 
[ \ N2iAx - bxo) 



7n+l 



Xq 



>, 



:i.6) 



in particular, all vectors in a reduced basis of L(Z)) have this form. 



For instance, if j4 = [2, 2, 2], and 6 = 3, (this corresponds to the infeasible IP 2xi + 2x2 + 2x3 
3, Xi £ {0, 1}, when the bounds on x are and e), then 



(X 
Nixo 
iV2(2a;i+2x2 + 2a;3-32;o) 



X 



Xo 



el? xl} 



(1.7) 



It is shown in [2], that if A'^2 ^^ -^1 ^^ 1 are suitably chosen, then in a reduced basis of L(L') 



n — m vectors arise from some with Ax = bxo, xq = 0, and 



1 vector will arise from an with Ax = bxQ, xq = 1. 



So the X vectors from the first group can form the columns of B, and the x from the last can serve 
as Xb- If LLL- or KZ-reduction (the precise definition is given later) is used to compute the reduced 
basis of L(D), then B is a basis reduced in the same sense. 

Followup papers on this reformulation technique were written by Louveaux and Wolsey [29], 
and Aardal and Lenstra [3", 1] . 



Questions to address The speedups obtained by the Aardal-Hurkens-Lenstra (AHL) reformu- 
lation lead to the following questions: 



(Ql) Is there a similarly effective reformulation technique for general (not equality constrained) 
IPs? 

(Q2) Why does the reformulation work? Can we analyse its action on a reasonably wide class of 
difficult IPs? 

More generally, one can ask: 

(Q3) What kind of integer programs are hard for a certain standard approach, such as branch-and- 
bound branching on individual variables, and easily solvable by a different approach? 

As to (Ql), one could simply add slacks to turn inequalities into equalities, and then apply the 
AHL reformulation. This option, however, has not been studied. The mentioned papers emphasize 
the importance of reducing the dimension of the space, and of the full-dimensionality of the refor- 
mulation. Moreover, reformulating an IP with n variables, m dense constraints, and some bounds 
in this way leads to a D matrix (see (jl.Sp ) with n + 2m + 1 rows and n + m + 1 columns. 

A recent paper of Aardal and Lenstra [3], and [1| addressed the second question. They considered 
an equality constrained knapsack problem with unbounded variables 

ax = j3 
X > (KP-EQ) 

X GZ", 

with the constraint vector a decomposing as a = pM + r, with p,r £ Z", p > 0, Ma positive 
integer, under the following assumption: 

Assumption 1. (1) rj/pj = maxj=i,...,„ {ri/pi}, ru/pk = minj=i,...,n {ri/pi}- 

(2) ai < a2 < ■ ■ ■ < a„; 

(3) ELi \r^\ < 2M; 

(4) M>2-r,/pj; 

(5) M>r,j/pj-2rk/pk- 

They proved the following: 

(1) Let Frob(a) denote the Frobenius number of ai, . . . ,an, i.e., the largest (3 integer for which 
dKP-EQP is infeasible. Then 

2 
{M^PjPk + M{pjrk + Pkrj) + rjrk){l - --j— j—) 

Frob(a) > ^""^^-^^ - (M + rJpA. (1.8) 

Pkrj - PjTk 

(2) In the reformulation p.4|) . if we denote the last column of B by bn-i, then 

||6n_i||>^ **"** =. (1.9) 

\/\\p\\ \\r 11^ —ypr^ y 



It is argued in [3] that the large right-hand side explains the hardness of the corresponding instance, 
and that the large norm of bn-i explains why the reformulation is easy: if we branch on 6„_i in 
the reformulation, only a small number of nodes are created in the branch-and-bound tree. In 
section [8] we show that there is a gap in the proof of (jl.Op . Here we also show an instance of a 
bounded polyhedron where the columns of the constraint matrix are LLL-reduced, but branching 
on a variable corresponding to the longest column produces exponentially many nodes. 

Among the other papers that motivated this research, two are "classical": [22J, and parts of 
|12j . They all address the hardness question in (Q3), and the easiness is straightforward to show. 

Jeroslow's knapsack instance in [22] is 

min Xn+i 
•s*- '^Y17=l^i + ^n+l = n (1-10) 

Xi G {0,1} (i = l,...,n + l), 

where n is an odd integer. The optimal solution of (ll.lOp is trivially 1, but branch-and-bound 
requires an exponential number of nodes to prove this, if we branch on individual variables. 

In [12] Todd and Avis constructed knapsack problems of the form 



max ax 
St. ax<(5 (1.11) 

xG {0,1}", 

with a decomposing as a = eM + r [M and r are chosen differently in the Todd- and in the 
Avis-problems). They showed that these instances exhibit a similar behavior. 

Though this is not mentioned in [22], or [12], it is straightforward to see that the Jeroslow-, 
Todd-, and Avis-problems can be solved at the rootnode, if one branches on the constraint X^iLi -^^ 
instead of branching on the Xj . 

A more recent work that motivated us is [T6|. Here a family of instances of the form 

max ax 

s.t. ax<l3 (1.12) 

xeZ!J:, 

with a decomposing as a = pM + r, where p and r are integral vectors, and M is a positive 
integer, was proposed. The authors used Frob(a) as P in (J1.12p . These problems turned out to 
be hard for commercial MIP solvers, but easy if one uses a test-set approach. One can also verify 
computationally that if one branches on the constraint px in these instances, then feeds the resulting 
subproblems to a commercial solver, they are solved quite quickly. 



Contributions, and organization of the paper We first fix basic terminology. When branch- 
and-bound (B&B for short) branches on individual variables, we call the resulting algorithm ordi- 
nary branch-and-bound. 



Definition 1. Ifp is an integral vector, and k an integer, then the logical expression px < k \/ px > 
k + 1 is called a split disjunction. We say that the infeasibility of an integer programming problem 
is proven by px < k V px > k + 1, if both polyhedra {x\px < /c } and {x|px>A; + l} have empty 
intersection with the feasible set of its LP relaxation. 

We say that the infeasibility of an integer programming problem is proven by branching on px, 
if px is nonintegral for all x in its LP relaxation. 

We call a knapsack problem with weight vector a a decomposable knapsack problem (DKP for 
short), if a = pM + r, where p and r are integral vectors, p > 0, and M is a large integer. We could 
not find a good definition of DKPs which would not be either too restrictive, or too permissive, as 
far as how large M should be. However, we will show how to find M and the bounds for given p 
and r so the resulting DKP has interesting properties. 

The paper focuses on the interplay of these concepts, and their connection to IP reformulation 
techniques. 

(1) In the rest of this section we describe a simple reformulation technique, called the rangespace 
reformulation for arbitrary integer programs. The dimension of the reformulated instance 
is the same as of the original. We also show a simplified method to compute the AHL 
reformulation, and illustrate how the reformulations work on some simple instances. 

For a convenient overview of the paper we state Theorems [1] and [2] as a sample of the main 
results. 

(2) In Section [2] we consider knapsack feasibility problems with a positive weight vector. We 
show a somewhat surprising result: if the infeasibility of such a problem is proven by px < 
ky px > k-\-l, with p positive, then a lower bound follows on the number of nodes that must 
be enumerated by ordinary B&B to prove infeasibility. So, easiness for constraint branching 
implies hardness for ordinary B&B. 

(3) In Section [3] we give two recipes to find DKPs, whose infeasibility is proven by the split 
disjunction px < k y px > k + 1. Split disjunctions for deriving cutting planes have been 
studied e.g. in [HI HH EJ [7]. This paper seems to be the first systematic study of knapsack 
problems with their infeasibility having such a short certificate. 

Thus (depending on the parameters), their hardness for ordinary B&B follows using the results 
of Section [2j We show that several well-known hard integer programs from the literature, 
such as the Jeroslow-problem [22j, and the Todd- and Avis-problems from [12j can be found 
using Recipe 1. Recipe 2 generates instances of type ( KP-EQD , with a short proof (a split 
disjunction) of their infeasibility. 

So this section provides a unifying framework to show the hardness of instances (for ordinary 
B&B) which are easy for constraint branching. These results add to the understanding of 
hard knapsacks described in [22], [12] and [3j, as follows. We deal with arbitrary knapsacks, 
both with bounded, and unbounded variables; we give explicit lower bounds on the number 
of nodes that ordinary B&B must enumerate, which is done in [12] for the Todd and Avis 
instances; and our instances have a short, split disjunction certificate. 

Using the recipes we generate some new, interesting examples. For instance. Example [8] is 
a knapsack problem whose infeasibility is proven by a single split disjunction, but ordinary 
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B&B needing a superexponential number of nodes to prove the same. Example [5] reverses the 
role of the two vectors in the Avis-problem, and gives an instance which is computationally 
more difficult than the original. 

(4) In Section Owe extend the lower bound (jl.Sp in two directions. We first show that for given 
p and r integral vectors, and sufficiently large M, there is a range of f3 integers for which the 
infeasibility of ( |KP-EQ| ) with a = pM + r is proven by branching on px. The smallest such 
integer is essentially the same as the lower bound in (jl.Sp . 

Any such /3 right-hand side is a lower bound on Frob(a), with a short certificate of being a 
lower bound, i.e. a split disjunction certificate of the infeasibility of ( |KP-EQ] ). 

We then study the largest integer for which the infeasibility of ( |KP-EQ] ) with a = pM + r, 
and M sufficiently large, is proven by branching on px. We call this number the p-branching 
Frobenius number, and give a lower and an upper bound on it. 

(5) In Section[5]we show some basic results on the geometry of the reformulations. Namely, given 
a vector say c, we find a vector which achieves the same width in the reformulation, as c does 
in the original problem. 

(6) Subsection 16.11 shows why DKPs become easy after the rangespace reformulation is applied. 
In Theorem [10] we prove that if M is sufficiently large, and the infeasibility of a DKP is 
proven by branching on px, then the infeasibility of the reformulated problem is proven by 
branching on the last few variables in the reformulation. How many "few" is will depend on 
the magnitude of M. We give a similar analysis for the AHL reformulation in Subsection 



Here we remark that a method which explicitly extracts "dominant" directions in an integer 
program was proposed by Cornuejols at al in in [IGj . 

(7) In Section [7] we present a computational study that compares the performance of an MIP 
solver before and after the application of the reformulations on certain DKP classes. 

(8) In Section [8] we point out a gap in the proof of (jl.Op . and show a correction. We also describe 
a bounded polyhedron with the columns of the constraint matrix forming an LLL-reduced 
basis, where branching on a variable corresponding to the longest column creates exponentially 
many subproblems. 



The rangespace reformulation Given 

b' < Ax <b 

X £ Z", 



(IP) 



we compute a unimodular (i.e. integral, with ±1 determinant) matrix U that makes the columns 
of AU short, and nearly orthogonal; U is computed using basis reduction, either the LLL- or the 
KZ-variant (our analysis will be unified). We then recast (lIPp as 

b' < iAU)y < b p 

y G Z". 

The dimension of the problem is unchanged; we will call this technique rangespace reformulation. 





xi Ui 



\ 



Figure 1: The polyhedron in Example [T] before and after reformulation 



Example 1. Consider the infeasible problem 



106 < 21x1 + 19x2 < 113 

< xi,X2 < 6 

Xi,X2 G Z, 



(1.13) 



with the feasible set of the LP-relaxation depicted on the first picture in Figure [TJ In a sense it is 
both hard, and easy. On the one hand, branching on either variable will produce at least 5 feasible 
nodes. On the other hand, the maximum and the minimum of xi + X2 over the LP relaxation of 
(J1.13p are 5.94, and 5.04, respectively, thus "branching" on this constraint proves infeasibility at 
the root node. 

When the rangespace reformulation with LLL-reduction is applied, we have 




^=1 0,C/= \,AU 



(1.14) 



so the reformulation is 

106 < -2yi + 7y2 < 113 

< -yi - 6^2 < 6 

< yi + 7y2 < 6 

yi,y2 e ^■ 

Branching on y2 immediately proves infeasibility, as the second picture in Figure [1] shows. The 
linear constraints of ()1.14p imply 

5.04 <y2< 5.94. (1.15) 

These bounds are the same as the bounds on xi + X2- This fact will follow from Theorem [71 a 
general result about how the widths are related along certain directions in the original and the 
reformulated problems. 



Example 2. This example is a simplification of Jeroslow's problem (jl.lOp from [22]. Let n be a 
positive odd integer. The problem 

< X < e (1.16) 

is integer infeasible. 

Ordinary B&B (i.e. B&B branching on the Xj variables) must enumerate at least 2^""^^ " nodes 
to prove infeasibility. To see this, suppose that at most {n — l)/2 variables are fixed to either or 
1. The sum of the coefficients of these variables is at most n — 1, while the sum of the coefficients 
of the free variables is at least n + 1. Thus, we can set some free variable(s) to a possibly fractional 
value to get an LP-feasible solution. 

On the other hand, denoting by e the vector of all ones, the maximum and minimum of ex over 
the LP relaxation of (J1.16P is n/2, thus branching on ex proves infeasibility at the root node. 

For the rangespace reformulation using LLL-reduction, we have 




0(n-l)xl\ ,^^| Q 




(1.17) 



(n-l)xl ' 



thus the reformulation is 

2yn = n 

0< 2/i,...,yn-i <1 

o< -T.l=ly^ + yn <i 

So branching on y„ immediately implies the infeasibility of (ll.lTp . and thus of (I1.16p . 



A simplified method to compute the AHL reformulation Rangespace reformulation only 
affects the constraint matrix, so it can be applied unchanged, if some of the two-sided inequalities 
in pP|) are actually equalities, as in Example [2j We can still choose a different way of reformulating 
the problem. Suppose that 

Aix = hi (1.18) 

is a system of equalities contained in the constraints of pPh . and assume that Ai has mi rows. 
First compute an integral matrix i?i, 

{ X E Z" I Aix = } = { BiA I A e Z"~"i }, 

and an integral vector xi with Axi = bi. Bi and xi can be found by a Hermite Normal Form 
computation - see e.g. [33], page 48. 

In general, the columns of [Bi, xi] will not be reduced. So, we substitute i?iA + xi into the part 
of (jIPp excluding (J1.18p . and apply the rangespace reformulation to the resulting system. 

If the system (jl.lSp contains all the constraints of the integer program other than the bounds, 
then this way we get the AHL reformulation. 
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Example 3. (Example [2] continued) In this example (J1.16p has no solution over the integers, 
irrespective of the bounds. 

However, we can rewrite it as 

2 YIi=i Xi + Xn+i = n 

< Xi-n < e , , 

- - (1.19) 

-1/2 <x„+i < 1/2 

The X integer vectors that satisfy the first equation in (J1.19p can be parametrized with A G Z" as 

Xl = Xl-\ h An 

X2 = -Ai 

: (1.20) 

^n — ^n— 1 

Xn+1 = -2Xn + n. 

Substituting (ll.20p into the bounds of ()1.19p we obtain the reformulation 

0< E"=iA,+A„ <1 

0< -A,- <l(j = l,...,n-l) 

-1/2 < -2An + n < 1/2 

AgZ". 

The columns of the constraint matrix of p.2ip are already reduced in the LLL-sense. The last 
constraint is equivalent to 

(n + l)/2 - 3/4 < A„ < (n + l)/2 - 1/4, (1.22) 

so the infeasibility of (jl.2ip and thus of (I1.19P is proven by branching on A^. 



Right-hand side reduction On several instances we found that reducing the right-hand side in 
ppp yields an even better reformulation. To do this, we rewrite ppp as 



Fx < f , , 

- ■' IP2 

a; G Z", ' ^ 

then reformulate the latter as 

{FU)y <f-{FU)xr p 

y GZ", 

where the unimodular U is again computed by basis reduction, and Xr G Z" to make / — {FU)xr 
short, and near orthogonal to the columns of FU. For the latter task, we may use ~ for instance 
- Babai's algorithm [5j to find Xr, so that {FU)xr is a nearly closest vector to / in the lattice 
generated by the columns of F. 

It is worth to do this, if the original constraint matrix, and right-hand side (rhs) both have large 
numbers. Since the rangespace reformulation reduces the matrix coefficients, leaving large numbers 
in the rhs may lead to numerical instability. Our analysis, however, will rely only on the reduction 
of the constraint matrix. 

11 



Rangespace, and AHL reformulation To discuss the connection of these techniques, we as- 
sume for simplicity that right-hand-side reduction is not apphed. 

Suppose that A is an integral matrix with m independent rows, and b is an integral column 
vector with m components. Then the equality constrained IP 

Ax = b 

i< X <u (1.23) 

has another, natural formulation: 

i< BX + x, <u 

X e Z"-"^, 

where 

{xGZ"|Ax = 0} = {5A|AgZ"-"^}, (1.25) 

and Xf, satisfies Axh = b. The matrix B can be constructed from A using an HNF computation. 
Clearly, to (|1.23p we can apply 

• the rangespace reformulation (whether the constraints are inequalities, or equalities), or 

• the AHL method, which is equivalent to applying the rangespace reformulation to (jl.24p . 

So, on (jl.23p the rangespace reformulation method can be viewed as a "primal" and the AHL 
reformulation as a "dual" method. The somewhat surprising fact is, that for a fairly large class of 
problems both work, both theoretically, and computationally. When both methods are applicable, 
we did not find a significant difference in their performance on the tested problem instances. 

An advantage of the rangespace reformulation is its simplicity. For instance, there is a one- 
to-one correspondence between "thin" branching directions in the original, and the reformulated 
problems, so in this sense the geometry of the feasible set is preserved. The correspondence is 
described in Theorem [7| in Section [5j The situation is more complicated for the AHL method, 
and correspondence results are described in Theorems [8] and [H These results use ideas from, and 
generalize Theorem 4.1 in [31j . 

Li a sense the AHL method can be used to simulate the rangespace method on an inequality 
constrained problem: we can simply add slacks beforehand. However: 

• the rangespace reformulation can be applied to an equality constrained problem as well, where 
there are no slacks; 

• the main point of our paper is not simply presenting a reformulation technique, but analysing 
it. The analysis must be carried out separately for the rangespace and AHL reformulations. 
In particular, the bounds on M that ensure that branching on the "backbone" constraint 
px in ()KPp will be mimicked by branching on a small number of individual variables in the 
reformulation will be smaller in the case of rangespace reformulation. 
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Using the rangespace reformulation is also natural when dealing with an optimization problem 
of the form 

max ex 

s.t. b' < Ax < b (IP-OPT) 

X £ IT-. 

Of course, we can reduce solving pP-OPTp to a sequence of feasibility problems. 

A simpler method is solving pP-OPTp by direct reformulation, i.e. by solving 

max cy 

St. b' < Ay < b (IP^T) 

y G Z", 

where 

c = cU,A = AU, 

with U having been computed to make the columns of 




reduced. 



Some other reformulation methods Among early references, the all-integral simplex algo- 
rithm of Gomory [19] can be viewed as a reformulation method. Bradley in pilj studied integer 
programs connected via unimodular transformations, akin to how the rangespace reformulation 
works. However, the transformations in [10] do not arise from basis reduction. 



The Integral Basis Method [2T] has two reformulation steps: in the first an integral basis of an 
IP from a nonintegral basis of the LP relaxation is found. In the second, an augmentation vector 
leading to a better integral solution is found, or shown not to exist. Haus in his dissertation f20] 
studied the question of how to derive such augmentation vectors for general IPs. 



Notation Vectors are denoted by lower case letters. In notation we do not distinguish between 
row and column vectors; the distinction will be clear from the context. Occasionally, we write {x, y) 
for the inner product of vectors x and y. 

We denote the sets of nonnegative, and positive integers by Z^_, and Z_|_^, respectively. The 
sets of nonnegative, and positive integral n- vectors are denoted by Z" , and Z"^, respectively. If 
n a positive integer, then A^ is the set {1, . . . ,n}. If S" is a subset of A^, and v an n- vector, then 
v{S) is defined as 'Yli^s'^i- 

For a matrix A we use a Matlab-like notation, and denote its jth row, and column by Aj^- and A-j, 
respectively. Also, we denote the subvector (a^, . . . , a^) of a vector a by a^.^. 

For p £ ^+-1-, and an integer k we write 

i{p,k) = max{i\p{F)<k,andp{N\F)>k + iyFON,\F\=i}. (1.26) 
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The definition impUes that i{p, k) = if A; < 0, or A; > ^^pj, and i{p, k) is large if the components 
of p are small relative to k, and not too different from each other. For example, if p = e, k < n/2, 
then i{p, k) = k. 

Sometimes £{p, k) is not easy to compute exactly, but we can use a good lower bound, which 
is usually easy to find. For instance, let n be an integer divisible by 4, p = (1, 2, . . . , n). The first 
3n/4 components of p sum to strictly more than (X]"=iPj)/2, and the last n/4 sum to strictly less 
than this. Since the components of p are ordered increasingly, it follows that 

£(p,n(n + l)/4) > n/4. 

On the other hand, l{p, k) can be zero, even if k is positive. For example, if p is superincreasing, 
i.e. Pi > pi + ■ ■ ■ + pi-i for i = 2, . . . , n, then it is easy to see that i{p, k) = for any positive 
integer k. 



Knapsack problems We will study knapsack feasibility problems 

Pi < ax < (32 
< X <u (KP) 

X £ Z". 

In the rest of the paper for the data of (jKPp we will use the following assumptions, that we collect 
here for convenience: 

Assumption 2. The row vectors a,u are in Z" + . We allow some or all components of u to be 
+00. If Ui = +CXD, and a > 0, then we define aui = +cxd, and ifb€z Z" + , is a row vector, then we 
define bu = +oo. We will assume < /3i < /32 < au. 

Recall the definition of a decomposable knapsack problem from Definition [TJ For the data 
vectors p and r from which we construct a we will occasionally (but not always) assume 

Assumption 3. p G ^++) r gZ"", p is not a multiple of r, and 

ri/pi <■■■ < rn/pn- (1-27) 



and 



he problems (I1.13J) and (jl.lfip 


p = 


( 1, 1), 


r = 


( 1,-1), 


u = 


( 6,6), 


M = 


20, 


a = 


pM + r = (21, 19), 


p 


= e, 


r 


= 0, 


u 


= e, 


M 


= 2, 


a 


= pM + r = 2e, 
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respectively. 



Width and Integer width 

Definition 2. Given a polyhedron Q, and an integral vector c, the width and the integer width of 
Q in the direction of c are 

width(c, Q) = max {cx\x£Q} — min { cj; | x G Q}, 
iwidth(c, Q) = [max {cx\x £ Q}\ — [min {cx\x £ Q}~\ + 1. 



If an integer programming problem is labeled by (P), and c is an integral vector, then with some 
abuse of notation we denote by width.{c, (P)) the width of the LP-relaxation of (P) in the direction 
c, and the meaning o/iwidth(c, (P)) is similar. 

The quantity iwidth(c, Q) is the number of nodes generated by B&B when branching on the 
constraint ex. 



Basis Reduction Recall the definition of a lattice generated by the columns of a rational matrix 
A from (II. 1|) . Suppose 

B=[bi,...,bn], (1.28) 

with bi G Z™. Due to the nature of our application, we will generally have n < m. While most 
results in the literature are stated for full-dimensional lattices, it is easy to see that they actually 
apply to the general case. Let 6J, . . . , 6* be the Gram-Schmidt orthogonalization of 6i, . . . , 6„, that 
is 

i 

(1.29) 



(1.30) 



(1.31) 

||/i,,_i6*_i + 6:f > IwbUf- (1-32) 

An LLL-reduced basis can be computed in polynomial time for varying n. 
Define the truncated sums 

i 

bi{k) = Y^ fi,jb* {l<k<i<n), (1.33) 

j=k 

and for i = 1, . . . ,n let Lj be the lattice generated by 

bi{i),bi+i{i),...,bnii). 

15 









bi = y ]l^ijb], 


with 


I^ii = 1 

^i^j = bfb*/\ 


\b*\ 


(i = 1,... ,n), 
IP (i = 1,... ,n; j = l,...,i 


We call 6i , . . 


. . ,bn LLL-reduced if 




\^ii^ < i^{l<j<i<n), 



We call bi, . . . ,bn Korkhine-Zolotarev reduced (KZ-reduced for short) if bi{i) is the shortest lattice 
vector in Li for all i. Since Li = L and 61 (1) = bi, in a KZ-reduced basis the first vector is the 
shortest vector of L. Computing the shortest vector in a lattice is expected to be hard, though it 
is not known to be NP-hard. It can be done in polynomial time when the dimension is fixed, and 
so can be computing a KZ-reduced basis. 

Definition 3. Given a BR method (for instance LLL, or KZ), suppose there is a constant Cn 
dependent only on n with the following property: for all full- dimensional lattices L(y4) in Z", and 
for all reduced bases { 5i, . . . , 6„ } of L(A), 

max{ II 61 II, . . . , II 6j II } < c„ max{ II di II, . . . , II dj II } (1-34) 

for all i < n, and any choice of linearly independent di, . . . ,di £ L(j4). We will then call Cn the 
reduction factor of the BR method. 

The reduction factors of LLL- and KZ-reduction are 2^""^)/^ (see p6]) and y/n (see [32]), 
respectively. For KZ-reduced bases, j32] gives a better bound, which depends on i, but for simplicity, 
we use -y/n. 

The kth. successive minimum of the lattice L(A) is 

Afc(L(S)) = min { 1 1 3 A; linearly independent vectors in L,{A) with norm at most t }. 
So (|1.34p can be rephrased as 

max{ II 61 II, . . . , II 6j II } < c„Aj(L(A)) for all i < n. (1.35) 

Otlier notation Given an integral matrix C with independent rows, the null lattice, or kernel 
lattice of C is 

N(C) = {veZ''\Cv = 0}. (L36) 

For vectors /, p, and u we write 

uiayi{f,p,i,u) = m.ax{ fx\px < £, < X < u}, 
min(/, p,i,u) = min {fx\px>i,0<x<u}. 

Theorems [T] and [2] below give a sample of our results from the following sections. The overall 
results of the paper are more detailed, but Theorems 1 and 2 are a convenient sample to first look 
at. 

Theorem 1. Let p E Z"^_, r S Z", and k and M integers with 

< k<j:i,p 

M > 2^(||r|| +1)2 IIpII +1. 
Then there are Pi, and ^2 integers that satisfy 

kM + ^/^\\r\\< pi </?2 < -\/n ||r|| +{k + l)M, (1.39) 

and for all such (/?i , P2) the problem IIKP\} with a = pM + r and u = e has the following properties: 
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(1) Its infeasibility is proven by px < k\/ px > k + 1. 

(2) Ordinary B&B needs at least 2^^^'^' nodes to prove its infeasibility regardless of the order in 
which the branching variables are chosen. (Recall the definition ofi(p,k) from ^.26\) ). 

(3) The infeasibility of its rangespace reformulation computed with KZ-reduction is proven at the 
rootnode by branching on the last variable. 

Theorem 2. Let p, and r be integral vectors satisfying Assumption\^ k and M integers with 

k > 0, , , 

~ ^ „ „ (1.40) 

M > m.ayi{krn/pn — kri/pi — ri/pi + 1, 2^/n \\r\\\\p\\}. 

Then there exists a (3 integer such that 

k{M + rn/pn) < /3 < ik + l)iM + ri/pi), (1.41) 

d for all such (3 the problem \KP-EQ^ with a = pM + r has the following properties: 



an 



(1) Its infeasibility is proven by px < k\/ px > k + 1. 

(2) Ordinary B&B needs at least 

([k/ IbllooJ +n-l\ 

nodes to prove its infeasibility, independently of the sequence in which the branching variables 
are chosen. 

(3) The infeasibility of the AHL reformulation computed with KZ-reduction is proven at the rootn- 
ode by branching on the last variable. 



2 Why easiness for constraint branching impUes hardness for or- 
dinary branch-and-bound 

In this section we prove a somewhat surprising result on instances of (jKPp . If the infeasibility is 
proven by branching on px, where p is a positive integral vector, then this implies a lower bound 
on the number of nodes that ordinary B&B must take to prove infeasibility. So in a sense easiness 
implies hardness! 

A node of the branch-and-bound tree is identified by the subset of the variables that are fixed 
there, and by the values that they are fixed to. We call {x,F) a node-fixing, ii F Q N, and x G Z 
with < Xi < UiWi €z F, i.e. a; is a collection of integers corresponding to the components of F. 

Theorem 3. Let p G ^++) '^iT'd k an integer such that the infeasibility of iKP\) is proven by 
px<k\/px>k-\-l. Recall the notation of i{p, k) from U.26\) . 

(1) If u = e, then ordinary B&B needs at least 2^^^'^' nodes to prove the infeasibility of / fJTPj) . 
independently of the sequence in which the branching variables are chosen. 
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(2) If Ui = +cxoVi, then ordinary B&B needs at least 

([k/ IbllooJ +n-l\ 

V --1 J 

nodes to prove the infeasibility of IIKP\) . independently of the sequence in which the branching 
variables are chosen. 



D 

To have a large lower bound on the number of B&B nodes that are necessary to prove infeasi- 
bility, it is sufficient for i{p, k) to be large, which is true, if the components of p are relatively small 
compared to k, and are not too different. That is, we do not need the components of the constraint 
vector a to be small, and not too different, as in Jeroslow's problem. 

First we need a lemma, for which one needs to recall the definition (ll.37p . 
Lemma 1. Let k be an integer with < k < pu. Then (Op and ^ below are equivalent: 

(1) The infeasibility of liKF\j is proven by px < k\/ px > k + 1. 

(2) 

max(a,p, A;,'u) < /3i < /32 < min(a,p, k + l,u). (2.42) 

Furthermore, if (CP holds, then ordinary B&B cannot prune any node with node-fixing {x,F) that 
satisfies 

/ Pi^i < k, and > piUi > A: + 1. (2.43) 

ieF i^F 

Proof Recall that we assume < /3i < /32 < clu. For brevity we will denote the box with upper 
bound u by 

Bu = {x\0<x<u}. (2.44) 

The implication ([2]) =^ ([1]) is trivial. To see ([I]) =^ (l2|) first assume to the contrary that the lower 
inequality in (j2.42p is violated, i.e. there is yi with 

yi G ^«, pyi < k, ayi > pi. (2.45) 

Let xi = 0. Then clearly 

xi £ Bu, pxi < k, axi < (3i. (2.46) 

So a convex combination of xi and yi, say z satisfies 

z £ Bu,pz <k, az = j3i, (2.47) 

a contradiction. Next, assume to the contrary that the upper inequality in ()2.42p is violated, i.e. 
there is 1/2 with 

2/2 G Bu, py2>k + 1, ay2 < /?2. (2.48) 
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Ui 


if i F. 



Define X2 by setting its ith component to lij, if Ui < +cxo, and to some large number a to be 
specified later, if Ui = +oo. If a is large enough, then 

X2 G Bu, px2>k + 1, 0X2 > P2- (2.49) 

Then a convex combination of X2 and 7/2; say w satisfies 

w £ Bu, pw > A; + 1, aw = P2, (2.50) 

a contradiction. So ([1]) =^ ([2]) is proven. 

Let {x, F) be a node- fixing that satisfies (j2.43p . Define x' and x" as 

(2.51) 

If Ui = +00, then x'- = Ui means "set x" to an a sufficiently large number". We have px' < k, so 
ax' < /3i; also, px" > k + 1, so ax" > (32 holds as well. Hence a convex combination of x' and x", 
say z is LP-feasible for (jKPh . Also, Zi = Xi {i £ F) must hold, so the node with node-fixing {x,F) 
is LP-feasible. 

D 

Proof of Theorem [3] Again, we use the notation B^ as in (j2.44p . 

First we show that < k < pu must hold. (The upper bound of course holds trivially, if any 
Ui is -|-oo.) If A; < 0, then px > k + 1 is true for all x G B^, so the infeasibility of (jKPp could not 
be proven hy px < k \/ px > k + 1. Similarly, if A; > pu, then px < k is true for all x G Bu, so the 
infeasibility of ()KPp could not be proven hy px < k\/ px > k + 1. 

For both parts, assume w.l.o.g. that we branch on variables xi,X2,... in this sequence. For 
part (1), let F = {1, 2, . . . ,i{p, k)}. From the definition of £{p, k) it follows that any fixing of the 
variables in F will satisfy (j2.43p . so the corresponding node will be LP-feasible. Since there are 
2^f'> such nodes, the claim follows. 

For part (2), let F = {1, . . . ,n — 1}, and assume that Xi is fixed to Xi for all i £ F. Since all 
UiS are -|-oo, this node- fixing will satisfy (j2.43p if 

Xi > OVi £ F, J^Pi^i ^ k- (2-52) 

We will now give a lower bound on the number of x G Z^ that satisfy (j2.52p . Clearly, (j2.52p holds, 
if 

^x^ < LA;/bl|ooJ (2.53) 

iG-F 

does. It is known (see e.g., |9], page 30) that the number of nonnegative integral (m-i, . . . , rrid) with 
rrii -|- • • • -|- nid < t is 

Cr)- 
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Using this with t = [k/ ||p||ooj; d = n — 1, the number of x G Z^ that satisfy (j2.52p is at least 

([k/ \\p\\oo\ +n-l\ 
[ n-l )• 

and so the number of LP feasible nodes is lower bounded by the same quantity. D 



3 Recipes for decomposable knapsacks 

In this section we give simple recipes to find instances of (jKPp and (KP-EQ) with a decomposable 



structure. The input of the recipes is the p and r vectors, an integer k, and the output is an integer 
M, a vector a with a = pM + r, and the bounds /3i and P2, or /?. The found instances will have 
their infeasibility proven by px < kVpx > k + 1, and if k is suitably chosen, be difficult for ordinary 
B&B by Theorem [3l We will show that several well-known hard integer programming instances 
are found by our recipes. 

The recipes are given in Figure [2] and in Figure O respectively. 



Recipe 1 

Input : Vectors p,u £ Z."^, r E Z", k integer with < k < pu. 
Output: M e Z_^+,a G Z!f^_^, /3i,/32 s.t. a = pM + r, 

and the infeasibility of (|KPp is proven hy px < k y px > k + 1. 

Choose M, /3i, /?2 s.t. pM + r > 0, and 

msix{r,p,k,u) + kM < /3i < /32 < mm{r,p,k + l,u) + {k + 1)M. (3.1) 

Set a = pM + r. 



Figure 2: Recipe 1 to generate DKPs 
Theorem 4. Recipes 1 and 2 are correct. 

Proof Since a = pM + r, 

max(a,p, A;, m) < max(r,p, A;, u) + /sM, (3.3) 

and 

min(a,p, A; + !,«) > min(r,p, A; + l,n) + A;M. (3.4) 

So the output of Recipe 1 satisfies 

max(a,p, /c, u) < (ii < (52 < min(a,p, A; + 1, u), (3.5) 
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Recipe 2 

Input : Vectors p,r G Z" satisfying Assumption [3l k nonnegative integer. 
Output: M,/3 e Z++, a e Z!J:+ s.t. a = pM + r, and 

the infeasibility of ( KP-EQ| ) is proven hy px < k V px > k + 1. 



Choose M,j3 e Z++ s.t. pM + r > 0, and 

0<k{M + rn/pn) < f3 < {k + l){M + n/pi). (3.2) 

Set a = pM + r. 



Figure 3: Recipe 2 to generate instances of (|KP-EQP 



and so the infeasibihty of the resulting DKP is proven by px < k\/ px > k + 1. 

For Recipe 2, note that with the components of u all equal to +oo, we have 

max(r,p, A;, u) = A;r„/p„, and (3-6) 

Tain{r,p,k + l,u) = {k + l)ri/pi, (3-7) 

so Recipe 2 is just a special case of Recipe 1. D 

Example [1] continued We created Example [J using Recipe 1: here pu = 12, so k = 5 has 

< k < pu, and 

max(r, p,k,u) = max { xi — X2 | < xi , X2 < 6, xi + X2 < 5 } = 5, 
min(r, p, k + l,u) = min{ xi — X2 | < xi,X2 < 6, xi + X2 > 6} = —6. 

So (|3.ip becomes 

5 + 5M < /3i < /?2 < -6 + 6M, 
hence M = 20, /3i = 106, (32 = 113 is a possible output of Recipe 1. 

Example [2] continued Example O can also be constructed via Recipe 1: now pu = n, so k = 

(n — l)/2 satisfies < k < pu. Then r = implies 

max(r,p, A;,u) = inm{r,p,k + l,u) = 0, 
so (j3.ip becomes 

^M < /3i < /32 < ^M, 

and M = 2, /?! = /?2 = n- is a possible output of Recipe 1. D 

Example 4. Let n be an odd integer, k = [n/2\ , p = u = e, and r an integral vector with 

n < r2 < • • • < r„. (3.8) 
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Then we claim that any M and /? = /3i = /32 is a possible output of Recipe 1, if 

a = L(i/2)Er=i(^+^0J, 

M + ri > 0, (3.9) 

M + rfc+i > (rfc+2H 'trn)-{ri^ hr^). 

Indeed, this easily follows from 

max(r,_p, A;,u) = rk+2^ l-r„, 

min(r,p, /c + l,u) = n H hrfc+r^+i. 

Two interesting, previously proposed hard knapsack instances can be obtained by picking r, M, 
and /3 that satisfy ^Mi- When 

r = (2^+1 + 1,..., 2^+" + 1), M = 2'^+^+\ (3.10) 

with t. = [log 2nJ , we obtain a feasibility version of a hard knapsack instance proposed by Todd in 
[12]. When 

r = (l,...,n), M = n(n + l), (3.11) 



we obtain a feasibility version of a hard knapsack instance proposed by Avis in [12 
So the instances are 



ax 



1 

2^^ 



n 
a,; 



,xG{o,ir 



with 

a = (2"+^+i + 2^+i + l,...,2"+^+i+2^+" + l), (3.12) 

for the Todd-problem, and 

a = (n(n + l) + l,...,n(n + 1) +n) (3.13) 

for the Avis-problem. 

Example 5. In this example we reverse the role of p and r from Example [H and will call the 
resulting DKP instance a reverse-Avis instance. This example illustrates how we can generate 
provably infeasible and provably hard instances from any p and r; also, the reverse-Avis instance 
will be harder from a practical viewpoint, as explained in Remark [5] below. 

Let n be a positive integer divisible by 4, 

p = (l,...,n), 

r = e, (3.14) 

k = n(n + l)/4. 

Since k = (X^"=iPi)/2, the first 3n/4 components oi p sum to strictly more than fc, and the last 
n/4 sum to strictly less than /c, so 

max(r,p,A;,n) < 3n/4, 
min(r,p. A; + l,n) > n/4. 
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Hence a straightforward computation shows that 

M = n/2 + 2, , , 

' ' (3.16) 

/? = /3i = /32 = 3n/4 + A:(n/2 + 2) + 1 

are a possible output of Recipe 1. 

Corollary 4. Ordinary B&B needs at least l^"-'^'''^ nodes to prove the infeasibility of Jeroslow's 
problem in Example O of the instances in Example [^ including the Avis- and Todd-problems, and 
at least 2"'^ nodes to prove the infeasibility of the reverse- Avis instance. 



Proof We use Part (1) of Theorem[3l In the first three instances n is odd, p = u = e, k = (n— 1)/2, 
so i{p, k) = k. In the reverse- Avis instance we have i{p, k) > n/4 as explained after the definition 
(fr26]l . 

D 

Remark 5. While we can prove a 2^"'^^'''^ lower bound for the Avis and the Todd instances, they 
are easy from a practical viewpoint: it is straightforward to see that a single knapsack cover cut 
proves their infeasibility. 

For the reverse-Avis problem we can prove only a 2"'^ lower bound, but this problem is hard 
even from a practical viewpoint. We chose n = 60, and ran the resulting instance using the CPLEX 
11 MIP solver. After enumerating 10 million nodes the solver could not verify the infeasibility. 

Next we give examples on the use of Recipe 2. 
Example 6. Let n = 2, 

k = 1, 

P = ih 1), 
r = (-11,5). 

Then (13. 2p in Recipe 2 becomes 

0< M + 5 </3<2(M-ll), (3.17) 

hence M = 29, /? = 35 is a possible output of Recipe 2. So the infeasibility of 

18x1 + 34x2 = 35 

Xi,X2 G Z_,_ 

is proven by xi + X2 < 1 V xi + X2 > 2, a fact that is easy to check directly. 



(3.18) 



D 

Example 7. In Recipe 2 M and /3 are constrained only by ri/pi and rn/pn- So, if n = 17, k = 1, 
and 



p = (1,1,..., 1), 

r = (-11,-10,. ..,0,1,. ..5), 
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then M = 29, /5 = 35 is still a possible output of Recipe 2. So the infeasibility of 

18x1 + 19x2 + • • • + 34x2 = 35 

Xi,X2, . . . ,Xi7 > (3.19) 

Xi,X2,...,Xi7 G Z_^ 

is proven by Ya=i ^j < 1 V Ya=i ^« - 2- 



D 

We finally give an example, in which the problem data has polynomial size in n, the infeasibility 
is proven by a split disjunction, but the number of nodes that ordinary B&B must enumerate to 
do the same is a superexponential function of n. 

Example 8. Let n and t be integers, n,t>2. We claim that the infeasibility of 

(n*+^ + l)xi H [- (n*+^ + n)x„ = n^*+^ + n*+^ + 1, 

Xj E Z I (i = 1, . . . ,n) 



(3.20) 



is proven by 

n n 

^Xj < n* V ^Xj > n* + l, (3.21) 



but ordinary B&B needs at least 



^{n-l)(t-l) 



nodes to prove the same. Indeed, 



p = e, 

r = (1,2,... ,n), 

k = n\ (3.22) 

M = n*+\and 

/3 = n2*+i+n*+i + l 

satisfy (j3.2|) . So the fact that the infeasibility is proven by (|3.21|) follows from the correctness of 
Recipe 2. By Part (2) of Theorem [3] ordinary B&B needs to enumerate at least 

(n* + n — l\ 
n-1 ) 

nodes to prove the infeasibility of (j3.2Up . But 

\ n-1 / \n-l \n-lj 
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4 Large right hand sides in ( [KP-EQ] ). The branching Frobenius 
number 



In this section we assume that p and r integral vectors which satisfy Assumption [3] are given, and 
let 

q= {ri/pi,...,rn/pn)- (4.23) 



Recipe 2 returns a vector a = pM + r, and an integral /5, such that the infeasibility of ( KP-EQ| ) 
with this P is proven by branching on px. 



The Frobenius number of a is defined as the largest integer /3 for which (KP-EQ) is infeasible, 
and it is denoted by Frob(a). This section extends the lower bound result (jl.Sp of Aardal and 
Lenstra in [3l H] in two directions. First, using Recipe 2, we show that for sufficiently large M 



there is a range of /3 integers for which the infeasibility of ( KP-EQ ) with a = pM + r is proven by 
branching on px. The smallest such integer is essentially the same as the lower bound in (jl.Sp . 



We will denote 



f{M,6) 



M + qi -6' 

Qn - qi 



(4.24) 



(for simplicity, the dependence on p and r is not shown in this definition) . 



Theorem 5. Suppose that f{M, 1) > (i.e., M > g„ - 2gi + 1), a £ Z^^, with a = pM + r. Then 
there is an integer (3 with 

f{M, 1)(M + g„) < /? < (/(M, 1) + 1)(M + gi), (4.25) 

and for all such (3 integers the infeasibility of (KP-EQ^ is proven by px < f{M, 1) Vpx > /(M, 1) + 1. 



Proof There is an integer (3 satisfying (|3.2|) in Recipe 2, if 

k{M + qn) + l<{k + l) (M + qi) 



(4.26) 



But it is straightforward to see that ()4.26p is equivalent to k < f{M,l). Choosing k = /(M, 1) 
turns (132D into (1051) . D 

Clearly, for all /? right-hand sides found by Recipe 2 

p < Frob(a). (4.27) 

Since for the /5 rhs values found by Recipe 2, the infeasibility of ( KP-EQ| ) has a short, split 



disjunction certificate, and there is no known "easy" method to prove the infeasibility of (KP-EQ) 
with /? equal to Frob(a), such /3 right-hand sides are interesting to study. 

Definition 6. Assume that f{M, 1) > 0, and a is a positive integral vector of the form a = pM + r. 
The p-branching Frobenius number of a is the largest right-hand side for which the infeasibility of 
( KP-EQ^ is proven by branching on px. It is denoted by 



Frobp(a). 
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Theorem 6. Assume that f{M, 1) > 0, and a is a positive integral vector of the form a = pM + r. 
Then 

f{M, 1)(M + (?„) < Frobp(a) < (/(M, 0) + 1) (M + qi) . (4.28) 



Proof The lower bound comes from Theorem [5j Recall the notation ()1.37p . If all components of 
u are +oo, then 

max{a,p,k,u) = kan/pn = k{M + qn), (4.29) 

mm{a,p,k + l,u) = {k + l)ai/pi = {k + 1){M + qi). (4.30) 



So Lemma [T] implies that if the infeasibility of ( KP-EQ| ) is proven hy px < k \/ px > k + 1, then 



k{M + qr,)<P<{k + l){M + qi), (4.31) 

hence 

k{M + qn)<{k + l){M + qi), (4.32) 

which is equivalent to 

^M + qi'^ 



, M + qi , 

k < — <^ k < 

Qn - qi 



In -qi 



^k<f{M,0). (4.33) 



The infeasibility of (KP-EQ ) is proven by branching on px iff it is proven hy px < k V px > k + 1 



for some nonnegative integer k. So, the largest such /? is strictly less than 

{k + l){M + qi), (4.34) 

with k < f{M, 0), so it is strictly less than (/(M, 0) + 1) (M + qi) , as requhed. 

D 
Example [6] continued Recall that in this example 

P = ih 1), 
r = (-11,5), 

so we have gi = —11, q2 = 5. So if M = 29, then /(M, 0) = /(M, 1) = 1, and the bounds in 
Theorem [5] become 

34 < /? < 36. 

Hence Theorem \5\ finds only /3 = 35, as the only integer for which the infeasibility of (|3.18|) is 
proven by branching on xi + 3:2 < 1 V j;i + a;2 > 2. 

Letting a = pM + r = (18, 34), Theorem [6] shows 

34 < Frobp(a) < 36, 

so Frobp(a) = 35. 
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5 The geometry of the original set, and the reformulation 

This section proves some basic results on the geometry of the reformulations using ideas from the 
recent article of Mehrotra and Li [31j . Our goal is to relate the width of a polyhedron to the width 
of its reformulation in a given direction. 

Theorem 7. Let 

Q = {x eW^\Ax<b], 
Q = {y eR''\AUy<b}, 

where U is a unimodular matrix, and c G Z" . 

Then 

max { ex I X G Q } = max { cUy | y G Q } , 
with X* attaining the maximum in Q if and only ifU^^x* attains it in Q. 

(2) 

width(c,(5) = width(ct/,(5). 

(3) 

iwidth(c, Q) = iwidth(cC/, Q). 



Proof Statement ([T]) follows from 

Q = {Uy\yeQ], (5.1) 

and an analogous result holds for "min" . Statements ([2]) and ^ are easy consequences. D 

Theorem [7] immediately implies 

Corollary 7. 

min widthfc, Q) = min widthfd, Q). 
cgZ"\{o} deZ"\{o} 

D 

Theorem 8. Suppose that the integral matrix A has n columns, and m linearly independent rows, 
let S he a polyhedron, and 

Q = {xeR'^lxeS, Ax = b}, 

Q = {A|yA + XbG5, AgM"-™}, 
where V is a basis matrix for N(A), and Xb satisfies Axf, = b. If c £ Z^ is a row vector, then 
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max { ex I X G Q } = cxh + max { cF A | A G Q}, 
with X* attaining the maximum in Q if and only if A* attains it in Q, where x* = VX* + Xf,. 

(2) 

width(c,(5) = width(cy;Q). 

(3) 

iwidth(c, Q) = iwidth(cy, Q). 

Proof Statement ^ follows from 

Q = { FA + Xb I A e Q}. 

An analogous result holds for "min", and statements ([2]) and ([3]) are then straightforward conse- 
quences. 

D 

Theorem [8] can be "reversed". That is, given a row vector d G Z"~™, we can find a row vector 
c^l/^, such that 

max { ex I X S Q } = max { dA | A G Q } + const. 

Looking at ([1]) in Theorem [HI for the given d it suffices to solve 

cV = d,ce Z". (5.2) 

The latter task is trivial, if we have a V* integral matrix such that 

V*V = In-m; (5.3) 

then e = dV* wiU solve (15.21) . To find V*, let W be an integral matrix such that U = [W,V] is 
unimodular; for instance W will do, if 

A[W,V] = [H,0], 

where H is the Hermite Normal Form of A. Then we can choose V* as the submatrix of U~^ 
consisting of the last n — m rows. 

In this way we have proved Theorem [S] and Corollary [51 which are essentially the same as 
Theorem 4.1, and Corollary 4.1 proven by Mehrotra and Li in 31j: 

Theorem 9. (Mehrotra and Li) Let Q, Q, V be as in Theorem\^ and V* a matrix satisfying Ii5.3\) . 
Then 

max { dA I A € Q } = max { dV*x | x € Q } — dV*Xh-, 
with X* attaining the maximum in Q if and only if A* attains it in Q, where x* = VX* + Xf,- 
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(2) 

width(dy*,Q) = width((i,Q). 

(3) 

iwidth(dy*,Q) = iwidth((i,g). 



Corollary 8. (Mehrotra and Li) Let Q,Q,V, and V* he as before. Then 

mill widthfd, Q) = min widthfc, Q). 

rfgZ"-'"\{o} ceL(y*r)\{o} 



D 



D 



6 Why the reformulations make DKPs easy 



This section will assume a decomposable structure on (IKPp and ( KP-EQ] ), that is 



a = pM + r, (6.1) 

with p G 2++, r S Z", and M an integer. We show that for large enough M the phenomenon of 
Examples [T] and [2] must happen, i.e., the originally difficult DKPs will turn into easy ones. 

We recall that for a given a matrix A, we use a Matlab-like notation, and denote its j row, 
and column by Aj^: and A:j, respectively. 

An outline of the results is: 

(1) If M is large enough, and U is the transformation matrix of the rangespace reformulation, 
then pU will have a "small" number of nonzeros. Considering the equivalence between the 
old and new variables Uy = x, this means that branching on just a few variables in the refor- 
mulation will "simulate" branching on the backbone constraint px in the original problem. 
An analogous result will hold for the AHL reformulation. 

(2) It is interesting to look at what happens, when branching on px does not prove infeasibility 
in the original problem, but the width in the direction of p is relatively small - this is the 



case in (KP-EQ) as we prove in Lemma [2] below. 

Invoking the results in Section [5] will prove that when M is sufficiently large, the same, or 
smaller width is achieved along a unit direction in either one of the reformulations. 

Lemma 2. Suppose that Assum.ption\^holds. Then 

width(p, dKP^EQl )) = e{/3/M^), (6.2) 

width(e^,( |KP^EQl )) = e(/3/M) Vi G {1, . . . ,n}. (6.3) 

In both equations the constant depends on p and r. 
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Proof : Since Oj = piM + ri, 

implies 

So 



and therefore 



n/Pl < ••• < rn/Pn 
pi/ai >■■■ >Pn/an- 



inax{px\ax = (3, X > 0} = (3pi/ai, 
in.m{px\ax = (3, X > 0} = (3pn/an, 



width(p, p^P^EQD ) = I3{pi/ai-pn/an) 

= (3{pian-Pnai)/{aia„ 
= /3(pir„ -p„ri)/(aia„ 



(6.4) 
(6.5) 



Also, 



hence 



Since 



max{ Xi\ax = P, X > 0} = fi/ai 
min{ Xj I ax = /5, a; > } = 0, 



width(ei,(KP-EQ)) = f3/a 



ai = e(M) Vie {l,...,n}, 



both (lO) and (ESl) follow 
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6.1 Analysis of the rangespace reformulation 

After the rangespace reformulation is applied, the problem (jKPp becomes 

A < {aU)y < p2 
< Uy <u 

y GZ", 

where the matrix U was computed by a BR algorithm with input 

a\ I pM -\- r\ 



A 



I . 



I 



Let us write 



A = AU, a = all, p = pU, f = rU, 
and fix Cn, the reduction factor of the used BR algorithm. 



(KP-R) 



(6.6) 
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Recall that for a lattice L, Afc(L) is the smallest real number t for which there are k linearly 
independent vectors in L with norm at most t. 

For brevity, we will denote 

ak = AkiN{p))ik = l,...,n-l). (6.7) 

First we need a technical lemma: 
Lemma 3. Let A be as in 116. 6\) . Then 

AkiUA)) < i\\r\\+l)akforke{l,...,n-l}. (6.8) 



Proof We need to show that there are k linearly independent vectors in I^{A) with norm bounded 
by (II r II +l)afc. 

Suppose that wi, . . . ,Wk are linearly independent vectors in N{p) with norm bounded by ak- 
Then Awi, . . . , Aw^ are linearly independent in L(yl), and 

hence 

\\Awi\\< {\\r\\ +1) \\wi\\< {\\r\\ +l)ak {i = l,...,k) 

follows, which proves (j6.8p . 

D 
Theorem 10. The following hold: 

(1) Let k < n — 1, and suppose 

M>c„(||r||+l)2afc. (6.9) 

Then 

Pl:k = 0. (6.10) 

Also, if the infeasibility of IIKP\) is proven by branching onpx, then the infeasibility of IlKP-R]) 
is proven by branching on yk+i, ■ ■ ■ ,yn- 

(2) Suppose 

M>c„(||r||+1)2 IIpII . (6.11) 

Then 

Pl:n~l = 0, (6.12) 

and 

width(e„, dEEEl)) < width(p, dlP])) .g^g. 

iwidth(e„,(JKEEl)) < iwidth(p,(lKP|)) 

In particular, in the rangespace reformulation of ^KP-E(^ the width, and the integer width 
in the direction of e„ are 

G(/3/m2). 
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D 

Before proving Theorem[10l we give some intuition to the validity of (j6.10p . and (j6.12p . Suppose 
M is "large" , compared to \\p\\, and 1 1 r 1 1 . In view of how the matrix A looks in ()6.6p , it is clear that 
its columns are not short, and near orthogonal, due to the presence of the nonzero pi components. 
Thus to make its columns short and nearly orthogonal, the best thing to do is to apply a unimodular 
transformation that eliminates "many" nonzero piS. 

Proof For brevity, denote by Q and Q the feasible set of the LP-relaxation of (jKPp and ()KP-Rp , 

respectively. 

Proof of ([1]) To show (j6.10p . fix j < k; we will prove pj = 0. 

Since A was computed by a BR algorithm with reduction factor Cn, Lemma [3] implies 

Hi; J II < c„(||r||+l)afc. (6.14) 

To get a contradiction, suppose pj ^ 0. Then, since pj is integral, 

M:,ill > \aj\ 

= \pjM + rj\ (6.15) 

> M-\rj\. 

Hence 

M < \\A.,j\\+\rj\ 

< ||i:j|| + ||r||||C/:j||, 

< ||i:j|| + ||r||||i:j||, (6.16) 
= (||r||+l)||A,|| 

< c„(||r|| +l)2afc, 

with the second inequality coming from Cauchy-Schwarz, the third from U-j being a subvector of 
A- J, and the fourth from (j6.14p . Thus, we obtained a contradiction to the choice of M, which 
proves Pj = 0. 

Suppose now that the infeasibility of (|KPp is proven by branching on px. We need to show: 

Vi e Zyi€{k + l,...,n} ^ y^Q. (6.17) 

Let y £ Q- Then 

Uy £ Q =^ pUy ^ Z =^ Pk+Wk+i H ^ PnVn ^ TL ^ yi ^ TL for somei G { A; + 1, . . . ,n}, 

as required. 

Proof of ([2]) The statement (|6.12p follows from (j6.10p . and the obvious fact, that a„_i <||p||, 

since there are n — 1 linearly independent vectors in N(p) with norm bounded by ||p||. 
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To see (j6.13p . we claim 

width(e„,(5) < width(p„e„,(5) 
= width{pU, Q) 
= width(p, Q). 

Indeed, the inequality follows from p„ being a nonzero integer. The first equality comes from 
(j6.12|) . and the second from ([T|) in Theorem [3 The inequalities hold, even if we replace "width" by 
"iwidth", so this proves the second inequality in (j6.13p . 

The claim about the width in the direction of e^ follows from (I6.13p . and Lemma [21 

D 

6.2 Analysis of the AHL-reformulation 

The technique we use to analyse the AHL reformulation is similar, but the bound on M, which is 
necessary for the dominant p direction to turn into a unit direction is different, li (3i = (32 = (i, 
then the AHL reformulation of (jKPp is 



< FA + X/3 <u 
A G 



(KP-N) 



where the matrix 1/ is a basis of N(a) computed by a BR algorithm, and axp = (5. 

Let us write p = pV, f = rV and recall the notation for Ok from ()6.7p . Again we need a lemma. 
Lemma 4. Let A; G { 1, . . . , n — 2 }. Then 

Afc(N(p)nN(r)) < 2\\r\\al^^. (6.18) 

Proof We need to show that there are k linearly independent vectors in N(p) n N(r) with norm 
bounded by 2 || r || a|_,_x- 

Suppose that wi,. . . ,Wk+i are linearly independent vectors in N{p) with norm bounded by 
Ofc+i. Let W = [wi,... ,Wk+i], and 

d = rW eZ^+^ 

Suppose w.l.o.g. that for some t £ {1,...,A; + 1} 

di / 0, . . . , dt / 0, dt+i = ... = 4+1 = 0. 

Then 

d2Wi - diW2, dswi - diws, ... , dtwi - diwt 

are t — 1 linearly independent vectors in N(p) n N(r) with norm bounded by 

2||d||ooafc+i = 2( max \rwi\]ak+i 
\i=i,...,t J 



< 2\\r\\{ max \\wi\\ ]ak+i 

\i=l,...,t II I -r 

< 2||r||a^+i. 
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The k + 1 — t vectors 

Wt+l,... ,Wk+l 

are obviously in N(p)nN(r), with their norm obeying the same bound, and the two groups together 
are hnearly independent. 

D 
Theorem 11. Suppose that p and r are not parallel. Then the following hold: 

(1) Let k < n — 2, and suppose 

M >2cn\\r\\^ al^i. (6.19) 

Then 

Pl:k = 0. (6.20) 

Also, if the infeasibility of liKP\) is proven by branching onpx, then the infeasibility of liKP-M) 
is proven by branching on A^+i, . . . , A„_i. 

(2) Suppose 

M> 2c„ ||rf ||p||2 . (6.21) 

Then 

Pl:n-2 = 0, (6.22) 

and 

width(e„_i, dKElD) < width(p, dlPD) 

iwidth(e„_i, (HEH)) < iwidth(p, dlP])). 

In particular, in the AHL reformulation of (KP-EQ\ ) the width, and the integer width in the 
direction of e„_i are 

e{p/M^). 



Proof First note that pV ^ 0, since aV = 0, pV = imphes rV = 0, hence p and r would be 
parallel. Also, for brevity, denote by Q and Q the feasible set of the LP-relaxation of (jKPp and 
(|KP-N[) . respectively. 

Proof of ([l]) To show (I6.20p . fix j < k; we will prove pj = 0. Suppose to the contrary that 

Pj / 0, then its absolute value is at least 1. Hence 

= \aVj\ = \pjM + rj\ 



(6.24) 

^ ivi — \rj\. 

Therefore 



M < \fj\ 



= \rVj 

< Ik II II ^j II 

< 2cn II rfa^+i. 
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Here the second inequality comes from Cauchy-Schwarz. The third is true, since the columns of V 
are a reduced basis of N(a) C N(p) n N(r), and by using Lemma HI 

Suppose now, that the infeasibility of (jKPh is proven by branching on px. We need to show: 

X^ £ Zyi£{k + l,...,n-l} ^ X^Q. (6.25) 

Let XgQ. Then 

VX + Xf3 e Q =^ p{VX + X13) ^ Z =^ Pfc+lAfe+i H h Pn~lXn-l + pxp Z =^ 

Ai Z for some is{/c + l,...,n — 1}, 
as required. 

Proof of ([2]) The statement (j6.22p again follows from the fact that there are n — 1 linearly 

independent vectors in N(p) with norm bounded by ||p||. 

We will now prove (|6.23p . Since Pn-i is an integer, its absolute value is at least 1. Hence 

width(e„_i,(0) < width(j5„_ie„_i,Q) 
= width(py, Q) 
= w\dth{p,Q), 

with first equality true because of (j6.22p . and the second one due to ^ in Theorem [HI The proof 
of the integer width follows analogously. 

The claim about the width in the direction of e„_i follows from (I6.23|) . and Lemma [2 

D 

6.3 Proof of Theorems [1] and [2] 

Proof of Theorem [1] Recipe 1 requires 

max{r,p,k,u) +kM < Pi < (32 < mm{r,p,k + l,u) + {k + 1)M. (6.26) 

Since now u = e, both max(r,p, /c,n) and min(r,p. A; + l,u) are bounded by || r ||i < y/n \\ r \\ in 
absolute value. So if /3i and /32 satisfy ()1.39p . then they are a possible output of Recipe 1, so the 
infeasibility of the resulting DKP is proven hy px < k\/ px > k + 1. If 

M >2^/^\\r\\+l, (6.27) 

then there is room in (jl.39p for /?i and (32 to be integers. Theorem [3] implies the lower bound on 
the number of nodes that ordinary B&B must enumerate to prove infeasibility. 

On the other hand, ([2]) in Theorem [10] with c„ = y/n implies that if 

M> V^(||r|| +1)2 IIpII, (6.28) 
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then the infeasibihty of the rangespace reformulation is proven by branching on the last variable. 
Finally, the bound on M in (fOSll imphes both (lOTD and (lOHl) . 

D 

Proof of Theorem [2] From the lower bound on M, there is a /? integer that satisfies (|1.4ip , and 
the fact that the resulting instance's infeasibihty is proven hy px < ky px > k + \ follows from 
the correctness of Recipe 2. The lower bound on the number of nodes that ordinary B&B must 
enumerate to prove infeasibihty follows from Theorem [3l 

The fact that the infeasibihty of the AHL reformulation is proven by branching on the last 
variable follows from ([2]) in Theorem 1111 with c„ = ^/n. D 

7 A computational study 

The theoretical part of the paper shows that 

• DKPs with suitably chosen parameters are hard for ordinary B&B, and easy for branching 
on px, just like Examples [TJ [51 and 

• both the rangespace, and AHL reformulations make them easy. The key point is that branch- 
ing on the last few variables in the reformulation simulates the effect of branching on px in 
the original problem. 

We now look at the question whether these results translate into practice. The papers [H [21 [291 E] 
tested the AHL-reformulation on the following instances: 

• In [2], equality constrained knapsacks arising from practical applications. 

• In [Ij, the marketshare problems |15j . 

• In [29], an extension of the marketshare problems. 

• In [3] the instances of ( KP-EQ[ ), with the rhs equal to Frob(a). 



Our tested instances are bounded DKPs both with equality and inequality constraints, and instances 
of dKP-EQp . 



In summary, we found the following. 

(1) On infeasible problems, both reformulations are effective in reducing the solution time of 
proving infeasibihty. 

(2) They are also effective on feasible problems. 

In feasible problems a solution may be found by accident, so it is not clear how to theoretically 
quantify the effect of various branching strategies, or the reformulations on such instances. 
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(3) They are also effective on optimization versions of DKPs. 

(4) When /?i = (32, i.e. both reformulations are applicable, there is no significant difference in 
their performance. 

The calculations are done on a Linux PC with a 3.2 GHz CPU. The MIP solver was CPLEX 9.0. 
For feasibility versions of integer programs, we used the sum of the variables as a dummy objective 
function. The basis reduction computations called the Korkhine-Zolotarev (KZ) subroutines from 
the Number Theory Library (NTL) version 5.4 (see |34j ) . 

We let n = 50, and first generate 10 vectors p, r G Z" with the components of p uniformly 
distributed in [1, 10] and the components of r uniformly distributed in [—10, 10]. We use these ten 
p, r pairs for all families of our instances. 

Recall the notation that for k £ I^, u £ Z"_(_ , 

iQax{r,p,k,u) = Biax{rx\px < k, < x < u},and 
min(r,p, A; + 1, u) = min{ rx |px > A: + 1, < x < n}. 



7.1 Bounded knapsack problems with u = e 

We used Recipe 1 to generate 10 difficult DKPs, with bounds on the variables, as follows: 
For each p,r we let 



and set 



u = e, M = 10000, k = n/2 = 25, a = pM + r, 

Pi = \ina'K{r,p,k,u) + kM~\, 

P2 = [min{r,p,k + l,u) + {k + l)M\. 

By the choice of the data /3i < P2 holds in all cases. We considered the following problems using 
these a, u, /3i, /32 : 



The basic infeasible knapsack problem: 



Pi < ax < 132 
< X <u (DKP-INFEAS) 

X G Z". 



The optimization version: 



max ax 

s.t. ax < 02 

< X < n 

X €Z". 
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(DKP-OPT) 



We denote by /3a the optimal value, and will use Pa for creating further instances. 

• The feasibility problem, with the rhs equal to Pa'- 

ax = Pa 
< X <u (DKP-FEAS-MAX) 

X G Z". 

• The feasibility problem, with the rhs set to make it infeasible: 

ax = Pa + l 
< X < -u (DKP-INFEAS-MIN) 

X G Z". 



On the last two families both reformulations are applicable. 

The results are in Table [H In the columns marked 'R', and 'N' we display the number of B&B 
nodes taken by CPLEX after rangespace and AHL-reformulation was applied, respectively. In the 
columns marked 'ORIG' we show the number of B&B nodes taken by CPLEX on the original 
formulation. 

Since the LP subproblems of these instances are easy to solve, we feel that the number of 
B&B nodes is a better way of comparing the performance of the MIP solver with and without the 
reformulation. 

We also verified that providing px as a branching direction in the original formulation makes 
these problems easy. We ran CPLEX on the original instances, after adding a new variable z, and 
the equation z = px, to the formulation. The results with this option are essentially the same as 
the results in the 'R' and 'N' columns. 



7.2 Bounded knapsack problems with u = lOe 

We repeated the above experiment with u = lOe, but all other settings the same. That is, using 
the same ten p, r pairs, we let 

u = lOe, M = 10000, k = n/2 = 25, a = pM + r, 

and set 

Pi = [max(r,p. A;, u) + A;M] , 

p2 = [m.in{r,p,k + l,u) + {k + l)M\, 



then solved the instances (|DKP-INFEASI) and (IDKP-OPTI) . (|DKP-FEAS-MAXI) and (|DKP-INFEAS-MINp 
as before. The results are in Table [2j The original formulations turned out to be more difficult 
now, whereas the reformulated problems were just as easy as in the u = e case. 
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CO 





RHS values 


(IDXP-INFEASP 


(|DKP-()PTp 


(IDKP-FEAS-MAXp 


(IDXP-INFEAS-MINP 


Ins 


fix 


/32 


Pa 


R 


ORIG 


R 


ORIG 


R 


N 


ORIG 


R 


N 


ORIG 


1 


250040 


259972 


250039 




4330785 


10 


7360728 


10 


1 


1219304 


1 


1 


3181671 


2 


250044 


259979 


250043 




2138598 


10 


2329217 


1 


1 


24130 


1 


1 


1880980 


3 


250069 


259973 


250068 




12480272 


20 


14006843 


10 


3 


13800 


1 


1 


11993912 * 


4 


250034 


259961 


250033 




1454260 


10 


2800898 


1 


5 


555144 


1 


1 


2531222 


5 


250037 


259975 


250036 




4811440 


10 


6715586 


1 


10 


155670 


1 


1 


4131652 


6 


250038 


259981 


250037 




3239982 


10 


2659752 


10 


10 


283776 


1 


1 


3155522 


7 


250085 


259948 


250084 




11579118 


10 


14598901 


10 


1 


107170 


1 


1 


10871441 * 


8 


250052 


259961 


250051 




8659516 


10 


15440957 


10 


1 


486255 


1 


1 


8097370 


9 


250045 


259984 


250044 




6393700 


20 


12520666 


1 


10 


82455 


1 


1 


6346153 


10 


250061 


259968 


250060 




12244168 


10 


14848327 


10 


1 


3600 


1 


1 


11929161 * 


Table 1: DKPs with n = 50, A; = 25, n = e, M = 10000. '*': 1 hour time limit exceeded 




RHS values 


(IDXP-INFEASP 


(|DKP-()PTp 


(ID ^P-FEAS-MAXp 


(|DKP-INFEAS-MINp 




Ins 


13' 


/? 


da 


R 


ORIG 


R 


ORIG 


R 


N 


ORIG 


R 


N 


ORIG 


1 


250083 


259719 


250082 




13204411 


1 


12927001 






2571521 


1 




11968829 * 


2 


250111 


259779 


250110 




13674751 


1 


13369911 






12441612 * 


1 




11968829 * 


3 


250156 


259729 


250155 




10939735 


1 


13737652 






1702224 


1 




10342918 * 


4 


250098 


259619 


250097 




14678404 


1 


12762803 






25917 


1 




13480436 * 


5 


250059 


259759 


250058 




14128736 


1 


13464255 






5829029 


1 




13070602 * 


6 


250051 


259799 


250050 




13979145 


10 


12310057 






597113 


1 




13211779 * 


7 


250206 


259489 


250205 




8895772 


10 


8725886 




10 


10046297 * 


1 




13211779 * 


8 


250111 


259619 


250110 




13198252 


1 


13799370 






12235292 * 


1 




13211779 * 


9 


250081 


259849 


250080 




13136603 


10 


13082057 






18687 


1 




12448850 * 


10 


250206 


259689 


250205 




9251523 


10 


12947576 






9692170 * 


1 




12448850 * 



Table 2: DKPs with n=50, k=25, u=10, M=10000. '*': 1 hour time limit exceeded 



7.3 Equality constrained, unbounded knapsack problems 



In this section we consider instances of the type 



ax = (3 
X > 

X gZ" 



(KP-EQ) 



We recah the following facts: 



If we choose M sufficiently large, and a (3 integer satisfying 



0< 



M + gi-1 
Qn -qi 



1 j (M + (?,) < /? < 



M + gi-1 

qn -qi 



{M + qi) 



then the infeasibility of ( KP-EQ ) is proven by branching on px 



(7.1) 



If /5* is the largest integer satisfying (17. ip . and Frob(a) the Frobenius number of a, then 
clearly /?* < Frob(a). 

Finding /?* is trivial, while computing Frob(a) requires solving a sequence of integer programs. 



We generated 20 instances as follows: using the same p, r pairs as in the previous experiments, 
we let 

M = 10000. 



Then the first instance with a fixed p, r pair arises by letting the rhs in (KP-EQ) be /3*, and the 
second by letting it to be equal to Frob(a). 



The ( KP-EQ] ) instances with fi = Frob(a) were already considered in [3]. 



Our computational results are in Table El Where we go further than [3] is by showing the 
following. 



(3* and Frob(a) are not too different, and neither is the difficulty of (KP-EQ) with these two 
different rhs values. 

Now both reformulations can be applied, and their performance is similar. 



According to Lemma [21 width(p, ( |KP-EQ] )) and iwidth(p, ( |KP-EQl )) both should be small 
compared to the width in unit directions, even when the infeasibility of ( |KP-EQ] ) is not proven 
by branching on px. This is indeed the case when j3 = Frob(a), and we list iwidth(|?, ( |KP-EQ| )) 
in Table [3l as well. 

In the column "px" we list the number of B&B nodes necessary to solve the problems, when 
the variable z, and the equation z = px, is added to the original problems. The results are 
similar to the ones obtained with the reformulations. 
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RHS, width 






ax = 


= 13* 




ax 


= Prob(a) 


# 


P* 


Frob(a) 


iwidth(p) 


R 


N 


px 


ORIG 


R 


N 


px 


ORIG 


1 


7683078 


7703088 








1 


7110020 t 


5 


1 


13 


7320060 t 


2 


8683916 


8703917 








1 


6997704 t 


8 


3 


15 


7123300 t 


3 


8325834 


8345840 








1 


15383299 f 


8 


1 


19 


15313074 t 


4 


10347239 


10367238 








1 


10053497 t 


14 


24 


18 


9928134 t 


5 


16655001 


16665004 








1 


9254836 f 


33 


19 


57 


7519023 t 


6 


9081818 


9121828 








1 


6802797 t 


21 


1 


45 


7011946 t 


7 


6245624 


6245632 








1 


7180978 t 


1 


1 


67 


7151382 t 


8 


10514739 


10534740 








1 


7164967 t 


1 


1 


20 


7178052 t 


9 


14275715 


14285716 








1 


7319379 t 


1 


1 


11 


7368436 f 


10 


9838851 


9838851 









1 


7520143 t 


1 


1 


1 


7230420 t 



Table 3: n=50, M=10000. 'f: 1 hour time limit exceeded 

7.4 Reformulated problems with basic MIP settings 

To confirm the easiness of the reformulated instances we reran all of them with the most basic 
CPLEX settings: no cuts, no aggregator, no presolve, and node selection set to depth first search. 
All instances finished within a hundred nodes. 

The instances and parameter files are publicly available from |17] . 



8 Comments on the analysis in |[3| 



In [3], Aardal and Lenstra studied the instances (KP-EQ) with the constraint vector a decomposing 
as 

a = pM + r, (8.1) 

with p G Z"^,r E Z", M a positive integer, under Assumption [TJ Recall that the reformulation 
p.4|) is constructed so that the columns of B form an LLL-reduced basis of N(a). 

Denoting the last column of B by 6„_i, Theorem 4 in [3] proves ()1.9p . which we recall here: 

lloll 



bn-l\\> 



VW\ 



-{pr 



T\2' 



and the following claims are made: 



It can be assumed without loss of generality, that the columns of B are ordered in a way that 
the first n — 2 form a basis for N{p) D N(r). This claim is used in the proof of Theorem 4. 

Denoting by Q the feasible set of the LP-relaxation of (jl.4p . 6n-i being long implies that 
iwidth(e„_i, Q) is small. 
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To reconcile the notation with that of [3], we remark that in the latter Lq and Lq are used, where 

Lq = N(a),and 



Lc 



N(p)nN(r). 



Here we provide Example [U in which p, r, M satisfy Assumption [H i? is an LLL-reduced basis 
of N(a), but pB has 2 nonzero components, so the first claim does not hold. In Example [T0| 
using a modification of a construction of Kannan in [25 we show a bounded polyhedron where 
the columns of the constraint matrix are LLL-reduced, but branching on a variable corresponding 
to the longest column produces exponentially many nodes. (Note that the polyhedron in [3] is 
unbounded.) Finally, in Remark [9] we clarify the connection with our results. 



Example 9. Let n = 6, 



P 

r 

M 

a 



( 1, 1, 3, 3, 3, 3), 

(-7,-4,-11,-6,-5,-1), 

24, 

(17, 20, 61, 66, 67, 71), 



B 



1 





-3 


1 





2 


-1 


-1 


-1 





-1 


-2 








-1 











1 


2 


-1 








-2 





1 


2 


1 


1 


-1 



The columns of B form an LLL-reduced basis of N(a). They form a basis, since with 

^ = (0,-3,1,0,0,0)^, 
the matrix [B, v] is unimodular, and 

a[B,v] = [Oix(„-i), gcd(a)]. 
LLL-reducedness is straightforward to check using the definition. But 

pB = (0, -1, -1, 0, 0), 
so we cannot choose n — 2 = 4 columns of B which would form a basis of N(p) Pi N(r). 
Example 10. Let p be a real number in (v3/2, 1), and define the columns of the matrix B G 



as 



h 

bs 



(P°,0,...,0)^ 

(p72,pi,...,0,0)^ 

(pO/2,pV2,P^...,0,0)^ 



(8.3) 



Consider the polyhedron 



ipV2,p'/2,py2,...,p^-y2,p^-^)^. 

Q = {A|0< BA< e„}. 
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Proposition 1. The following hold: 

(1) The columns of B are an LLL-reduced basis of the lattice that they generate. 

(2) bn is the longest among the bi. 

(3) width(e„, Q) > c" || 5„ || for some c > 1. 



Proof We have 

b* = p'~^ei (i = l,...,n), 

and when writing bi = Yl]=i l^ijb*ji 

/ii,i-i = 1/2. (8.4) 

Thus (jl.32p in the definition of LLL-reducedness becomes 

||6*||>-^||6*_i||, (8.5) 

which follows from p > l/v2. Since 

Hi, l|2 III, ||2 , 2(n-2) / 2 3\ /o c\ 

this implies ([2]). By the definition of Gram-Schmidt orthogonalization, for any A^ we can always 
set the other Aj so 

B\ = A„6* = \nP^~ e„, 

so 

2 

width(e„,(3) > — — j-. 

and ([3]) follows, since p < 1, and || 6n ||< "- + 1. 

We can make B integral and still have ([1]) through ([3]) hold, by scaling, and rounding it. D 

Remark 9. Our Theorem 1111 proves that if M > 2'"^^^'^ || r \^\\p |p, and the reformulation is 
computed using LLL-reduction, then (p-B)i:(„_2) = 0, and from this it does follow that the first 
n — 2 columns of B form a basis of N(p) n N(r). 

Theorem 1 1 1 1 then finishes the analysis in a different way, by directly proving small width, namely 
showing 

width(e„_i, dEEH)) < width(p, dlP])), and 
iwidth(e„_i, (HEH)) < iwidth(i3, dKEl)). 
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